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THE  RANK  INPUT  METHOD  ANO  PROBABILITY  VARIATION  GUIDES 


1.  INTRODUCTION 

The  author  has  presented  numerous  training  sessions  on  making  subjective  probability  forecasts. 
From  this  experience  he  finds  a  forcaster  can  make  good  reliable  subjective  forecasts.  Given 
standard  weather  information,  all  that  is  additionally  needed  is  the  basic  knowledge  of  probability, 
practice  training,  the  time  to  formulate  a  forecast,  and  prompt  verification  feedback.  However, 
urgent  military  requirements  sometimes  leave  little  time  for  training  for  new  locations  or 
categories.  Also,  It  Is  straightforward  to  subjectively  forecast  for  two  categories,  but  it  is  time 
consuming  to  determine  probabilities  for  many  categories.  For  example,  the  TEAM  SPIRIT  exercise  in 
Korea  in  1977  required  probabilities  for  six  ceiling  categories.  The  rank  input  method  (RIM)  and 
probability  variation  guides  (PVG)  have  been  developed  to  solve  this  problem. 

The  RIM  and  PVG  assume  that  a  forecaster  with  training  and  experience  can  generate  the  degree  of 
certainty  that  a  synoptic  situation  favors  a  specific  event.  Ordinarily,  additional  training  and  in 
particular  verification  feedback,  is  necessary  to  convert  the  degree  of  certainty  into  a  reliable 
probability  forecast.  The  basic  idea  of  the  RIM  is  to  use  an  objective  procedure  to  convert  the 
subjective  degree  of  certainty  into  a  reliable  probability  forecast  of  an  event.  A  PVG  is  a  table 
of  RIM  output  for  manual  use. 

The  forecaster's  input  can  be  a  rank  input,  a  single  category  input,  or  a  categorical  forecast. 
These  inputs  are  effectively  scaled  by  the  climatological  frequency  of  the  event  and  the  forecaster 
skill.  To  understand  the  equations  involved,  some  knowledge  of  the  Transnormalized  Regression 
Probability  Forecast  Model  (TRP)  is  needed. 

2.  TRANSNORMALIZED  REGRESSION  PROBABILITY 

The  TRP  model  Is  a  statistical  model  that  can  be  used  to  produce  probability  forecasts  in  an 
empirical  fashion  such  as  regression  estimate  of  event  probabilities  (REEP)  and  multiple 
discriminant  analysis  (MDA).  However,  TRP  differs  in  two  respects.  First,  TRP  distinguishes 
between  direct  predictors  which  are  generated  by  nature  and  have  inherent  distribution  and  marginal, 
predictors  such  as  location  or  verification  time  which  are  selected  by  man  and  have  no  inherent 
distribution.  The  term  marginal  comes  from  the  fact  that  these  predictors  adjust  the  distribution 
of  the  direct  predictors  which  are  considered  marginal  distributions  of  the  multivariate  normal. 

The  second  major  difference  is  the  analytical  nature  of  the  TRP  model.  This  analytical  nature 
allows  for  a  priori  verification,  forecast  distributions,  and  modeling  of  correlation  and 
climatology.  The  TRP  model  consists  of  three  basic  parts:  trans normalization,  correlation,  and 
multivariate  normal  probability.  Details  of  the  TRP  model  are  in  Boehm  (1976). 

2.1  Transnormalization 


Transnormalization  is  the  process  of  transforming  a  variable  so  that  it  is  normally  distributed. 
Variables  that  have  been  transnormalized  are  called  equivalent  normal  deviates  (END)  and  are 
designated  with  a  quote  (")  above  the  variable.  A  transnormalization  procedure  can  be  developed 
from  climatology  rather  than  the  small  concurrent  predictor/predictand  data  base  usually  available. 
Having  each  variable  normally  distributed  is  a  necessary  (but  not  sufficient)  condition  for  a 
multivariate  normal  distribution.  The  assumption  of  multivariate  normality,  given  that  each 
variable  is  individually  normally  distributed,  is  the  basic  and  only  assumption  of  the  TRP  model. 

2.2  Correlation 


The  Pearson  product  moment  coefficient  of  correlation  is  frequently  used  of  calculate 
correlation  between  two  variables;  however,  this  procedure  can  give  a  highly  biased  estimate  when 
one  or  both  of  the  variables  have  been  categorized.  The  mutivariate  normal  conditional  probability 
equation  requires  the  correlation  between  the  continuous  underlying  variables  before  they  have  been 
categorized,  not  the  correlation  corresponding  to  the  linear  least  squares  fit  between  categorized 
variables.  Pearson  developed  many  other  formulas  to  estimate  the  correlation  between  underlying 
variables  given  only  the  categorized  variables.  Tetrachoric  correlation  Is  used  for  correlating 
binary  against  binary.  However,  beware  of  several  simple  formulas  that  give  rough  est mates:  a  more 
exact  estimate  requires  a  series  expansion.  Biserial  correlation  is  used  tften  one  variate  is 
continuous  and  the  other  binary.  The  original  formula  developed  by  Pearson  (and  given  in  most 
statistical  texts)  Is  unstable;  an  interactive  scheme  has  been  developed  by  Tate  (1955). 
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The  ailtlnormal  conditional  probability  of  the  predict  and  (Y)  being  less  than  a  threshold  o?) 
given  the  m  predictors  xi,  x?,...^  1$ 


Pr(Y  <y|xl§  . . 5*)  •  ♦  [(y  -  RN)/(1  - 


(i) 


vbere  N  equals  («xxl  ♦  «2X2  +••••»*****)  *1  are  regression  coefficients,  and  R  Is  the  correlation. 
The  cumulative  normal  distribution,  i(u),  can  be  approximated  (absolute  error  less  than  0.0004)  by 
Muench's  (Touart,  1973)  formula 


$ (u)  =  [1  +  Tanh(0.8u  +  0.035u^)]/2 


(2) 


Gringorten  (1978)  extended  the  equation  for  multiple  predlctands. 

Given  the  assumption  of  multinormality,  N  Is  normally  distributed  with  a  mean  of  zero  and  a 
variance  of  one.  Furthermore,  the  normality  of  N  Is  quite  robust  since  the  sum  of  many  random 
variables  will  tend  to  be  normally  distributed  even  If  the  individual  variables  are  not  normally 
distributed.  The  fact  that  the  distribution  of  N  Is  known  Is  exploited  In  the  rank  Input  method. 

3.  RANK  INPUT  METHOD 

Subjective  Input  may  be  a  rank  input,  a  single  probability  (to  produce  multiple  probabilities), 
or  a  categorical  input*  Each  of  these  inputs  is  transformed  to  the  equivalent  value  of  N  aqd  then 
Equation  (1)  is  used  to  produce  probabilities  for  specific  categories  by  changing  y,  the 
climatological  value  of  the  predictand.  The  method  requires  the  climatology  of  the  predictand  and 
the  forecaster's  skill  measured  by  the  correlation  R.  For  celling/visibility  forecasts,  R  Is 
approximately 


R  *  0.98t  (T  Is  In  hours)  (3) 


This  model  was  fitted  with  data  from  Touart  (1973)  and  tested  with  several  thousand  3-,  6-,  12-,  and 
24-hour  Air  Weather  Service  Forecasts. 

3.1  The  Rank  Input 

With  this  procedure,  the  forecaster  first  ranks  the  synoptic  situation  with  respect  to  the 
element  being  forecast.  For  example,  with  a  warm  front  moving  Into  the  area  of  Interest,  the 
forecaster  may  decide  that  this  Is  the  90  percent  worst  situation  with  respect  to  low  ceilings. 
Here  the  term  synoptic  situation  is  taken  to  Include  the  forecast  position  of  synoptic  features  as 
well  as  the  current  position.  Indeed,  the  forecaster  needs  to  include  all  information  (e.g.,  output 
from  an  objective  forecast  study)  In  Ms  determination  of  the  ranking.  Note  that  during  a  long 
period.  Including  normal  weather  fluctuation,  rank  Inputs  should  be  uniform.  This  Is,  the  number  of 
times  the  forecaster  selects  a  rank  0-10  percent  should  equal  (within  sampling  error)  the  number 
between  10-20  percent,  or  20-30  percent,  etc. 

The  rank  input  (w)  is  expressed  as  a  fraction,  e.g.,  0.01  worst  to  0.99  best.  This  fraction  is 
converted  to  an  END  via  the  Inverse  cumulative  normal  which  can  be  approximated  (less  than  0.05 
error  for  w  from  0.001  to  0.999)  by  a  formula  developed  by  Joiner  and  Rosenblatt  (1971): 


w  *  [w0*14  -  (l  -  #)0.14]  4%9i 


(4) 


The  N  In  Equation  (1)  Is  set  to  minus  w  which  makes  N  the  END  of  the  cumulative  best  versus  the 
cumulative  worst.  What  Is  worst  or  best  depends  on  a  particular  application.  Thus,  a  double  check 
on  the  sign  of  N  Is  wise.  An  example  of  RIM  using  the  rank  Input  Is  given  In  Table  1. 
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Table  1.  Example  of  a  Probability  Variation  Guide  Using  the  Rank  Input  Method. 
Rank  printed  here  as  percentage  was  converted  to  an  END  by  Eq.  (4),  then  Eq.  (1) 
was  used  to  calculate  probabi  1  ities .  Correlation  was  obtained  from  Eq.  (3). 
Climatology  of  the  four  categories  was:  0.03,  0.106,  0.207,  and  0.685. 
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3*2  The  Single  Probability  Input 


Forecast  probabilities  for  multiple  ranked  categories  can  be  obtained  from  the  forecast 
probability  of  a  single  category  by  solving  for  N  in  Equation  (1) 


n  «  Cy  «  p  (i-R2) 1/2]/r 

ii 

where  y  is  the  END  of  the  climatological  probability  of  the  Input  category,  and  P  is  the  forecast 
probability  of  the  Input  category.  The  calculated  N  Is  then  used  In  Equation  (^)  to  produce 
forecast  probability  for  each  of  the  other  categories  by  substituting  the  appropriate  y.  An  example 
of  a  RIM  using  a  single  probability  Input  Is  given  In  Table  2. 


Table  2.  Example  of  a  Probability  Variation  Guide  Using  a  Single  Cate¬ 
gory  Input.  Correlation  is  from  Eq.  (3).  Climatology  values  are: 
input  0.3,  predlctands  0.01,  0.05,  0.1,  0.3,  0.54. 
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A  categorical  forecast  (e.g.,  a  400-ft  ceiling)  can  be  converted  to  a  probability  forecast 
provided  distribution  of  the  categorical  input  Is  known.  Two  forecast  distributions  often  occur. 

One  is  that  the  categorical  forecast  Is  distributed  the  same  as  the  observations.  For  example, 
Somerville,  Watkins,  and  Daley  (1978)  found  that  observed  sky  cover  could  be  fitted  with  a  Johnson's 
Bounded  distribution 


P  =  A  +  B  In  [n/(l-n)]  (6) 


it 

where  P  is  the  END  of  the  climatological  distribution.  A  and  B  are  parameters  depending  on 
location,  time  of  day,  and  time  of  year,  and  n  Is  the  fraction  of  sky  cover.  If  the  forecast  sky 
cover  and  observation  distributions  are  the  same,  then  the  N  in  Equation  (1)  can  be  equal  to  P  from 
Equation  (6)  given  the  categorical  forecast  of  the  n.  Table  3  is  an  example  of  a  RIM  using  a 
categorical  forecast  input. 

The  second  way  that  categorical  forecasts  are  likely  to  be  distributed  is  that  a  categorical 
forecast  is  the  result  of  a  root -mean -square  estimate.  In  this  case,  its  variance  will  be  less  than 
the  variance  of  the  observed  distribution  by  a  fraction  equal  to  the  correlation  squared.  The 
procedure  is  to  inflate  the  categorical  predictor  by  dividing  the  deviation  from  the  mean  by  the 
value  of  correlation  and  then  using  the  above  result. 


Table  3.  An  Example  of  a  Probability  Variation  Guide  Using 
a  Categorical  Input.  Climatology  from  Eq.  (6)  with  A  = 
0.339,  B  =  0.217.  CLR  defined  as  less  than  0.05  coverage; 
OVC  greater  than  0.95  coverage.  The  mid-point,  e.g.,  1/2 
PR(CLR),  was  used  as  the  predictor  probability  value  for 
CLR  and  QVC. 

PROBABILITY  OF  BEDFORD  0600  SKY  COVER 
USING  A  CATEGORICAL  30-H0UR  FORECAST 
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4.  PROBABILITY  VARIATION  GUIDES 

The  RIM  can  be  implemented  on  a  real-time  computer  system  or  even  on  one  of  the  smaller 
programmable  calculators.  However,  If  neither  of  these  are  available,  tables  of  probability 
forecasts  called  Probability  Variation  Guides  can  be  produced.  Each  location,  forecast  element, 
forecast  projection,  and  verifying  time  requires  a  separate  table.  The  tables  may  use  the  rank 
Input  (Table  1),  single  category  (Table  2),  or  categorical  input  (Table  3). 

The  tables  are  also  valuable  for  training  forecasters  and  for  use  by  planners  who  need  to  see 
the  expectancy  of.  various  probability  values. 

b.  COMPLETE  FORECAST  DISTRIBUTIONS 

When  the  climatology  of  a  variable  can  be  expressed  as  a  differentiable  function.  It  Is  possible 
to  generate  complete  forecast  distributions  with  N  as  the  only  variable.  To  obtain  such  a  result, 
the  y  In  Equation  (1)  Is  replaced  with  a  climatological  function  and  the  right-hand  side  of  Equation 
(1)  Is  differentiated  with  respect  to  the  predlctand  using  Leibniz's  theorem  for  differentiation  of 
an  Integral. 
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Consider,  for  example,  the  climatological  distribution  of  sky  cover  as  given  by  Equation  (b). 
Substituting  Equation  (6)  into  Equation  (1)  and  differentiating  gives 

BMAjg. Mn/ll-nH  -  RN)/ tl~«2 >1^2 >  (?) 

f(n)  ‘  (l-rty/z  n(l-n) 

where  0  is  the  ordinate  of  the  standard  normal  density,  and  f(n)  is  the  forecast  probability  density 
for  a  given  fraction  of  sky  cover  (n)  varying  only  with  the  single  parameter  N.  For  correlation 
equal  to  0,  the  climatological  probability  density  results 

=  n(l-n)  ♦  £A+B  ln[n/(l-n)]}  («) 

Typical  curves  of  Equations  (7)  and  (8)  are  shown  in  Figure  1. 


0  0.2  0.4  0.6  0.6  1.0 


Figure  1.  Fraction  of  Cloud  Cover  for  Bedford, 
November  0600.  Climatology  from  Eq.  (8)  with 
A  =  0.339,  B  =  0.217.  Forecast  densities  are 
from  Eq.  (7)  with  R  =  0.9B. 


6.  HANK  INPUT  AS  A  FORECASTER  MULTIPLIER 

Current  operational  forecasts  are  being  made  for  the  most  part  using  a  man-machine  mix  (Snellman 
1977).  The  computer  is  used  to  compute  large-scale  weather  changes  and  specific  forecasts  through 
objective  methods.  This  guidance  is  passed  to  the  forecaster  who,  using  experience,  "tunes"  the 
information  before  passing  It  to  the  customer.  Many  computer  programs  are  being  developed  to 
provide  the  forecaster  with  even  more  timely  and  varied  guidance.  The  system  works  and  is  in 
widespread  use.  However,  Snellman  points  out  some  danger  signs.  Forecasters  are  operating  more  as 
communicators  and  less  as  meteorologists.  The  system  encourages  the  forecasters  to  follow  guidelines 
blindly.  The  author  has  observed  situations  where  the  volume  of  guidance  has  inundated  the 
forecaster  who  then  has  no  time  to  properly  assimilate  the  information,  let  alone  determine  an 
optimal  tailored  forecast.  Therefore,  this  type  of  system  could  be  termed  a  bottleneck  system— with 
the  forecaster  as  the  bottleneck.  The  computer  is  driving  the  man  rather  than  vice  versa.  Given 
this  state  of  affairs  a  completely  automated  system  then  seems  to  be  very  practical. 

Consider  the  entire  man-machine  mix.  Computers  have  capability  of  errorless  high-speed 
computation,  storage,  and  formatting.  But  people  have  Initiative,  flexabillty,  and  ability  to  see 
the  total  picture.  It  appears  we  are  not  using  the  most  effective  combination  of  these  qualities. 
An  alternate  system  Is  to  give  the  forecaster  general  synotic  Information  and  let  him  or  her  input 
into  the  computer.  Let  the  computer  build  bulletins  at  proper  times,  add  local  climatology,  and 
answer  standard  query  response  questions.  In  this  setup  the  forecaster  becomes  the  driver  of  the 
system  and  his  capability  has  multiplied.  The  Rank  Input  Method  provides  an  algorithm  for  such  a 
system  which  is  described  in  the  following  section. 
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6.1  The  Probability  Window 

The  concept  of  a  probability  window  is  based  on  the  premise  that  for  the  longer  forecast  times, 
only  the  large-scale  weather  patterns  can  be  forecast.  However,  a  given  larger  scale  weather 
pattern  can  provide  detailed  small-scale  forecasts  if  detailed  small-scale  climatology  is  available. 
Therefore,  the  changable  part  of  the  forecast,  i.e.,  the  part  that  carries  the  forecast  information, 
can  be  represented  on  a  large-scale  grid  with  relatively  few  numbers.  The  large-scale  forecast  can 
be  interpolated  to  any  point  within  the  window.  At  that  point  the  large-scale  value  is  combined 
with  point  climatology  to  produce  a  point  forecast.  A  description  of  a  probability  window  using  an 
objective  input  is  given  in  Boehm  (1977). 

6.2  REFQRGER  7b 


REFORGER  is  an  annual  exercise  that  tests  the  ability  of  US  forces  to  support  NATO  commitments. 
In  addition  to  standard  weather  support,  probability  forecasts  for  visibility  and  combined  ceiling 
visibility  categories  were  required  in  1978.  These  forecasts  were  for  a  rectangular  13  X  10  mesogrid 
in  central  Germany  with  the  southwest  corner  at  49°51*N.  8°20'E  using  a  10 *  east -west  and  7* 

north-south  grid  spacing.  The  four  categories  were:  (1)  ceiling  greater  than  or  equal  to  100  feet 
and  visibility  greater  than  or  equal  to  900  meters;  (2)  visibility  greater  than  or  equal  to  1500 
meters;  (3)  visibility  greater  than  or  equal  to  3000  meters;  and  (4)  ceiling  greater  than  or  equal 
2000  feet  and  visibility  greater  than  or  equal  to  800  meters.  Forecast  projections  were  12,  18,  24, 
30,  36,  48,  and  60  hours.  Separate  subjective  rankings  for  ceiling  and  visibility  were  input  at 
four  macroscale  grid  points  located  near  the  corners  of  the  window.  Thus,  at  a  given  projection 
eight  (4  input  points  X  2  CI6  VSBY)  input  rankings  produced  520  (130  grids  points  X  4  categories) 
probability  forecasts. 

The  procedure  was  verified  1  August  to  30  September  1978  at  four  stations  in  Germany:  Frankfurt 
AB  (EDDF),  Maurice  Rose  AAF  (EDEN),  Fulda  AAF  (EDEX),  and  Kitzingen  AAF  (EDIN).  Brier  scores  for 
Frankfurt  AB,  12-hour  forecast,  category  3  were  RIM  0.30,  climatology  0.37,  sample  climatology  is 
the  frequency  of  the  event  during  the  verification  period,  sample  climatology  0.29.  This  relation 
held  for  all  stations  and  forecast  projections:  the  forecast  showed  a  definite  imporvement  over 
climatology  but  sample  climatology  did  even  better.  Keep  in  mind  sample  climatology  is  only 
completely  known  after  the  fact.  Further  analyst  showed  that  ceiling  and  visibility  was  much  higher 
than  average.  Frankfurt  AB  had  category  4  93  percent  of  the  time  on  the  12-hour  forecast  projection 
compared  to  the  long-term  climatology  of  78  percent.  A  feedback  system  showing  the  accumulated 
ranking  would  have  failed  to  correct  for  this  run  of  good  weather. 

Major  Frank  Globakar  (1978)  who  evaluated  the  method  concluded: 

Capt  (now  Maj)  Boehm's  model  shows  promise.  The  forecaster  did  not  have 
to  be  concerned  with  specific  weather  categories  and  was  able  to  produce 
a  large  number  of  probability  forecasts  with  a  few  simple  rankings  of  the 
of  the  forecast  synoptic  situaiton.  The  method  showed  more  skill  than 
either  persistence  or  climatology  which  are  the  usual  benchmarks  of  skill. 

7.  PROBABILITY  AND  TRIANGULAR  COORDINATES 

Probabilities  for  three  categories  are  often  plotted  on  triangular  coordinate  graph  paper  since 
any  point  in  the  triangle  presents  a  set  of  three  probabilities  that  add  up  to  1.  The  idea  can  be 
extended  to  more  than  three  categories.  For  four  categories  a  three-sided  equilateral  pyramid  is 
used.  Here  we  will  stay  with  three  categories  but  keep  in  mind  the  result  can  be  applied  to  a 
higher  number  of  categories. 

7.1  TRP  In  Triangular  Coordinates 

For  the  internal  category  of  ranked  categories  (category  2  of  3)  the  TRP  equation  is 


P2  -  PrtfHY^2)  *  «-C(J>2-RN)/(l-R2)1/23  -  ♦£Ol-WO/(l-R2)1/2]  (9) 


When  Pi,  and  p3  are  plotted  in  triangular  coordinates  an  interesting  curve  results.  Most 

surprising  Is  that  all  points  fall  on  a  single  curved  line,  not  an  area.  The  line  runs  from  Pi  =  1 
to  P3  8  1,  is  symmetrical  with  respect  to  Pi  and  P3,  and  P2  reaches  a  maximum  value  when  Pi  =  P3. 
The  higher  the  correlation  and  climatological  probability  of  category  2,  the  closer  the  line 
approaches  P2  *  1*  See  Figures  2  and  3.  The  max  P2  Is 


Max  P2  *  2  *{(y2-yi)/[2(l-R2)1/2:i) 


(10) 
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Figure  2.  TRP  Probability  Values  on  Triangular  Figure  3.  Same  as  Figure  1  except  climatology 

Coordinate  Paper.  All  curves  use  climatology  of  of  all  categories  is  one-third.  Forecast  line 

category  (1)  =  0.2,  (2)  =  0.3,  and  (3)  =  0.5.  for  correlation  of  0.3  is  drawn  only  to  N  *  *3. 

Circles  on  the  lines  indicate  N  28  0,  i.e.,  half 

of  the  forecasts  will  be  to  the  left  along  the 

curve,  the  other  half  to  the  right.  A  dash 

across  the  curve  indicates  N  =  ±1,  i.e.,  about 

68  percent  of  the  forecasts  will  lie  between  the 

two  dashes. 


Proof:  maximum  difference  between  the  right-hand  terms  of  Equation  (9)  occurs  with  arguments 
centered  on  0  since  the  slope  is  maximum  at  0  and  the  absolute  value  monotonical ly  decreases  away 
from  0.  To  be  centered  about  0  requires  the  first  argument  to  equal  minus  the  second  argument. 
Solve  this  equation  for  N  and  substitute  Into  Equation  (9).  Then  use  $  (-u)  -  1  -  $  (u). 


Categorical  Forecast  Decision  Rules 


Inspection  of  the  TRP  Equations  (1)  and  (9)  shows  that  aiy  point  on  a  given  curve  is  a  function 
of  the  single  parameter  N.  Keeping  in  mind  the  method  is  limited  to  ranked  categories,  any  decision 
rule,  in  principle,  can  be  rewritten  with  single  parameter  N  or  the  equivalent  vector  (not  matrix^) 
of  critical  probabilities  of  cummulatlve  categories. 


Since  N  is  normally  distributed,  the  expected  frequency  that  forecasts  will  fall  in  any  decision 
region  can  be  calculated.  For  example,  suppose  the  requirement  is  to  have  the  number  of  times  the 
end  category  is  categorically  forecast  equal  to  Itj  observed  frequency.  The  critical  probability 
Pc  for  this  requirement  Is  obtained  by  setting  N  =  y  in  Equation  (1)  giving 


Pc  [Pr(0B)  =  Pr(FCST) ]  -  <*[y (1-R )/ (l-R^ ) 1 /2] 


(11) 


Any  scoring  system  can  be  evaluated  with  a  given  set  of  critical  probabilities  by  simulating 
forecasts  by  varying  N  which  Is  normally  distributed.  For  two  categories  the  expected  frequency 
that  a  forecast  will  be  successful  (S)  for  a  given  critical  probability  Pc  can  be  obtained  by 
integration  of  the  bivariate  normal 

S  *  ${[Pc(l-R2)l/2.y]/Rf  y,  R]  (12) 


This  equation  has  been  used  to  prepare  tables  (Boehm,  1977)  for  various  critical  probabilities 
as  an  aid  to  decision  makers  to  subjectively  determine  a  critical  probability  by  weighing  forecast 
hits  against  misses. 
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8.  CONCLUSIONS 

The  Rank  Input  Method  and  Probability  Variation  Guide*  offer  means  for  multiplying  the 
effectiveness  of  a  weather  forecaster.  They  allow  the  forecaster  to  drive  the  computer  system 
instead  of  the  computer  driving  the  forecaster.  The  single  forecast  index  of  the  TRP  model  is 
usefull  in  many  decision  analysis  problems  and  has  many  potential  applications. 
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Appendi x 

GENERAL  FORM  OF  ERROR  DISTRIBUTIONS 


Let  U  be  an  observed  variable,  e.g.,  visibility,  sky  cover,  etc.  Let  u  be  a  specific  value  of 
U,  e.g.,  thiree  miles  visibility.  Let  y  be  a  climatological  probability  that  U  is  less  than  or  equal 
to  u.  Let  y  be  the  equivalent  normal  deviate  of  y. 

If  the  forecast  probability  (P)  is  given  by  (1),  then  the  change  in  probability  for  a  change  in 
u,  i.e.,  the  forecast  density  (dP/du)  can  be  obtained  by  differentiating  (1)  with  respect  to  u  via 
Leibniz's  theorem  or  differentiation  of  an  intergral: 


(1  -  R2)l/2 


&L 

du 


(Al) 


ii 

When  y  is  given  directly  as  a  function  of  u,  e.g.,  the  lognormal  distribution,  then  the  function  and 
its  derivitive  can  be  inserted  into  (Al)  to  give  the  density Has  a  function  of  u.  When  y  is  given  as 
a  function  of  u,  f(u),  then  approximations  such  as  (4)  give  y  as  a  f unci  ton  of  u: 


y  =  t f(u)-l*  -  [1  -  f(u)]-14}  4.91 


(A2) 


Differentiating  (A2)  with  respect  to  u  gives: 

ii 

=  0.6874  {f(u)-«86  +  [1  -  f (u)]-.86}8fM 
du  du 

Substituting  (A2)  and  (A3)  into  (Al)  gives  the  density  as  a  function  of  u. 


t 


